An Improved Accuracy of Multiclass Random Forest Classifier with Continuous Attribute Transformation Using Random Percentile Generation
نویسندگان
چکیده
This study aims to improve classification accuracy by transforming continuous attributes into categories randomly generating percentile values as categorization limits. Four algorithms were compared for the generation of and selected based on small variability distribution highest revenue expectations. The testing training data becomes second consideration. Random forest (RF) is modeled from percentiles with three transformation variations. results ANOVA test, algorithm variations transformation, has a mean that not significantly different best model original dataset model. However, in some data, RF attribute was superior effectiveness this very well applied LR, MLP, NB methods. In tuition fee dataset, application methods each had an 0.178, 0.204, 0.318. give significant increase 0.967, 0.949, 0.594 method, respectively. date fruits effective MLP method 0.193 (original attribute) 0.690 (continuous transformation). are effectively MPL, datasets categorical mixed attributes.
منابع مشابه
An Improved Random Forest Classifier for Text Categorization
This paper proposes an improved random forest algorithm for classifying text data. This algorithm is particularly designed for analyzing very high dimensional data with multiple classes whose well-known representative data is text corpus. A novel feature weighting method and tree selection method are developed and synergistically served for making random forest framework well suited to categori...
متن کاملThresholding a Random Forest Classifier
The original Random Forest derives the final result with respect to the number of leaf nodes voted for the corresponding class. Each leaf node is treated equally and the class with the most number of votes wins. Certain leaf nodes in the topology have better classification accuracies and others often lead to a wrong decision. Also the performance of the forest for different classes differs due ...
متن کاملAttribute bagging: improving accuracy of classifier ensembles by using random feature subsets
We present attribute bagging (AB), a technique for improving the accuracy and stability of classi#er ensembles induced using random subsets of features. AB is a wrapper method that can be used with any learning algorithm. It establishes an appropriate attribute subset size and then randomly selects subsets of features, creating projections of the training set on which the ensemble classi#ers ar...
متن کاملAutomated epileptic seizure detection using improved correlation-based feature selection with random forest classifier
Analysis of electroencephalogram (EEG) signal is crucial due to its non-stationary characteristics, which could lead the way to proper detection method for the treatment of patients with neurological abnormalities, especially for epilepsy. The performance of EEG-based epileptic seizure detection relies largely on the quality of selected features from an EEG data that characterize seizure activi...
متن کاملSegmentation of retinal OCT images using a random forest classifier
Optical coherence tomography (OCT) has become one of the most common tools for diagnosis of retinal abnormalities. Both retinal morphology and layer thickness can provide important information to aid in the differential diagnosis of these abnormalities. Automatic segmentation methods are essential to providing these thickness measurements since the manual delineation of each layer is cumbersome...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal on Advanced Science, Engineering and Information Technology
سال: 2023
ISSN: ['2088-5334', '2460-6952']
DOI: https://doi.org/10.18517/ijaseit.13.3.18379